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ZERO-CROSSINGS  ANALYSIS 


by 

Benjamin  Kedem 

Department  of  Mathematics,  University  of  Maryland,  College  Park 

ABSTRACT 

We  advance  a  coherent  development  of  zero-crossing  based 
methods  and  theory  appropriate  for  fast  signal  analysis.  Quite  a 
few  ideas  pertaining  to  zero-crossing  counts  found  in  the  litera¬ 
ture  can  be  expressed  and  interpreted  with  the  help  of  this  more 
general  setup.  A  central  issue  addressed  in  some  detail  is  the 
fruitful  connection  which  exists  between  zero-crossing  counts  and 
linear  filtering.  This  connection  is  explored  and  interpreted  with 
the  help  of  a  certain  zero-crossing  spectral  representation,  and 
is  then  applied  in  spectral  analysis,  detection  and  discrimination. 
Zero-crossing  counts  in  filtered  time  series  are  called  higher 
order  crossings.  The  theme  of  this  work  is  that  higher  order 
crossings  analysis  provides  a  useful  descriptive  as  well  as  ana¬ 
lytical  tool  that  can  in  many  respects  match  spectral  analysis. 

To  a  great  extent  these  two  types  of  analysis  are  in  fact  equiva¬ 
lent,  but  each  emphasizes  a  different  point  of  view.  Advantages 
offered  by  higher  order  crossings  are  great  simplicity  and  a 
drastic  data  reduction. 
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ZERO-CROSSINGS  ANALYSIS 


Benjamin  Kedem 


1.  INTRODUCTION 

We  shall  be  concerned  in  this  paper  with  developing  a  certain 
systematic  approach  to  the  analysis  of  random  signals  based  on  zero¬ 
crossing  counts.  The  need  for  such  a  development  stems  from  the 
interest  in  zero-crossing  based  methods  and  techniques  in  fields 
such  as  signal  processing  [1],  [2],  [3],  fluid  mechanics  [  4  ] ,  [  5  ] , 
speech  processing  [6],  [7],  biomedical  engineering  [8],  optics  [9], 
[10],  neurophysiology  [11],  structural  dynamics  [12],  [13],  communi¬ 
cations  [14],  [15],  and  image  processing  [16],  [17].  It  is  felt  that 
a  more  general  setup  can  provide  a  better  understanding  of  these  tech¬ 
niques  and  of  their  outcomes,  and  furthermore,  suggests  Interesting 
and  useful  new  methods  of  analysis. 

The  first  serious  attempt  to  study  properties  of  zero-crossings 
is  the  pioneering  work  of  Kac  [18]  and  Rice  [19]  who  were  mainly  in¬ 
terested  in  moment  and  distribution  problems  pertaining  to  zero¬ 
crossing  counts.  This  work  and  related  level  crossing  problems  ^ 

are  discussed  and  reviewed  in  [2],  [20],  [21],  [22],  [23],  [24], 
where  numerous  additional  references  can  be  found.  None  of  this 
voluminous  work,  though,  will  be  reproduced  here. 

There  is  a  great  deal  more  to  zero-crossings  other  than  the 
classical  distribution  and  moment  problems,  and  we  shall  be  con¬ 
cerned  with  somewhat  more  general  properties  of  zero-crossing 
counts  and  zero-crossing  based  methods  of  analysis.  In  particular. 
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the  present  work  explores  the  fruitful  connection  which  exists  be¬ 
tween  zero-crossing  counts  and  time  invariant  linear  filtering. 

As  we  shall  wee,  this  connection  sheds  light  on  interesting  and 
surprisingly  useful  properties  of  zero-crossings  appropriate  for 
the  fast  analysis  of  random  signals.  Furthermore,  it  will  become 
evident  that  zero-crossing  counts  in  random  signals  and  their 
filtered  versions  essentially  constitute  a  domain  by  itself  which 
in  many  respects  is  equivalent  to  the  spectral  domain.  This  will 
be  demonstrated  and  illustrated  by  a  fair  number  of  examples,  using 
real  and  artificial  data. 

To  motivate  some  key  ideas  presented  in  this  paper,  it  is  in¬ 
structive  and  helpful  at  this  early  stage  to  consider  an  example 
[25],  [26],  [27]  which  is  most  illuminating.  Figure  1  presents 
the  graphs  of  two  signals  in  discrete  time,  t = 1 ,2 , . . . , 100,  each 
made  of  a  superposition  of  two  finite  sinusoids  with  angular  fre¬ 
quencies  0.1  and  3.  As  a  general  rule  we  adhere  to  discrete  time 
which  is  more  convenient  for  machine  calculations.  Consider  the 
first  graph  (a)  and  suppose  it  is  desired  to  estimate  the  highest 
frequency  which  in  this  case  is  3.  Let  and  denote  the  num¬ 
ber  of  zero-crossings  and  the  number  of  local  minima  and  maxima, 
respectively,  when  the  discrete  time  points  are  connected  by  line 
segments.  Since  the  low  frequency  component  in  (a)  is  obviously 
dominant,  ~  ^  small  for  serving  as  a  reasonable  estimate. 

On  the  other  hand,  =  92  gives  a  good  deal  more  information 
about  the  periodicity  of  the  high  frequency  component  as  it  better 
captures  the  component's  oscillation  characteristics.  Indeed,  by 
using  proper  normalization,  our  estimate  for  the  highest  frequency 


3 


is  ir*  92/99  -  2.9  which  is  quite  reasonable.  Now  D2,  ignoring  end 
effects,  is  the  number  of  zero-crossings  in  the  differenced  signal 
and  a  difference  is  a  high-pass  filter  which  amplifies  high  fre¬ 
quency  components.  Thus,  the  crucial  point  is  that  in  this  case  we 
must  render  the  high  frequency  component  dominant  by  a  filter  in 
order  to  be  able  to  estimate  its  frequency  from  zero-crossings.  On 
the  other  hand,  in  the  second  case  (b)  the  high  frequency  component 
is  already  dominant  and  =  95  yields  it*  95/99  =3  as  expected. 
Note  that  in  this  second  case  =  94  is  very  close  to  which  is 
an  indication  [28],  [29]  of  the  presence  of  a  strong  periodic  com¬ 
ponent.  It  is  seen  that  for  the  cases  considered  the  pair  (D^.D^) 
provides  useful  spectral  Information. 

Another  observation  pertains  to  the  discrimination  potential 
of  (Dj^,D2).  The  two  graphs  appear  to  be  obviously  different  and 
this  difference  in  appearance  results  in  different  pairs  (D^.D^). 
This  shows  that  differences  in  signals  can  be  expressed  very 
economically  by  zero-crossings  observed  in  the  signal  and  in 
functions  thereof. 

In  what  follows,  these  ideas  will  be  developed  within  a  con¬ 
venient  framework  which  is  very  effective  in  describing  the  domain 
of  zero-crossings. 

The  paper  is  divided  into  four  parts.  The  first,  consisting 
of  Section  2,  introduces  some  basic  notions  and  terminology  of 
stationary  random  processes.  The  second  (3.1 -3.4)  discusses 
general  properties  of  zero-crossings.  As  a  matter  of  convenience 
important  points  and  facts  are  summarized  at  times  in  the  form  of 
theorems.  We  do  not  prove  these  theorems  but  give  intuitive 
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arguments  instead  of  formal  proofs.  The  third  part  (4. 1-4. 5)  dis¬ 
cusses  spectral  analysis  by  zero-crossings,  and  the  fourth  (5.1  -5.5) 
is  devoted  to  discrimination  between  time  series.  The  paper  con¬ 
tains  quite  a  few  examples  and  illustrations  which  enhance  its 
tutorial  nature. 


2.  SOME  PRELIMINARIES  AND  NOTATION 


A  random  signal  in  discrete  time  is  commonly  referred  to  as  a 
stochastic  or  random  process.  Without  loss  of  generality  we  let 
"discrete  time"  be  the  set  of  integers  and  denote  the  random 
process  by  {Z^},  t =  . . . , -1, 0, 1 , . . . ,  where  for  each  fixed  t,  is 
a  random  quantity.  Examples  of  such  random  processes  are  any  elec¬ 
tronic  noise  sampled  at  integral  time  points  and  random  digit 
streams.  E  will  denote  mathematical  expectation  or  mean.  In  the 
context  of  random  processes  E  stands  for  ensemble  average.  Many 
of  the  ideas  which  we  would  like  to  bring  up  are  best  explained  if 
we  make  the  simplifying  assumption  that  {Z^.}  is  a  real  stationary 
Gaussian  random  process  with  mean  zero.  By  this  we  mean  that 
"statistics"  do  not  change  in  time  and  that  the  joint  probability 

distribution  of  any  finite  vector  (Z  ,...,Z  )  has  a  multivariate 

1  k 

normal  distribution.  In  this  case  for  each  fixed  t>  has  the 
usual  Gaussian  distribution.  There  are  a  number  of  very  good 
references  [21],  [30].  [311»  [32],  [33],  [3A]  which  cover  the  theory  and 
applications  of  stationary  processes  and  to  which  the  reader  is 
referred  for  complete  treatment.  Here,  however,  we  summarize  very 
briefly  some  basic  notions  needed  in  the  sequel. 

A  stationary  random  process  is  characterized  by  the  require¬ 
ment  that  the  mean  EZ^  ^  U  be  a  constant  and  that  the  covariance 
f unct ion 

is  a  function  of  tlie  lag  k  only.  Tiien  the  celebrated  Wiener-Khint- 
chine  Theorem  [33  I  .  p.  73,  states  that  there  exists  a  monotone 


increasing  function  F(ijj)  such  that 


Y, 


cos(k.a))dF(u)) 


'-t; 

where  tt  is  the  highest  possible  angular  frequency  due  to  the  dis¬ 
creteness  of  t.  F  is  called  the  spectral  distribution  function  of 
the  random  process.  F(t:)  is  equal  to  the  variance  (total  power) 
of  since 


dF  (oj)  . 


•'-'IT 

Except  for  a  constant,  F  behaves  like  a  cumulative 
tribution  function  and  therefore  can  be  decomposed, 
cal  purposes,  into  two  components 


probability  dis- 
for  all  practi- 


F(w)  =  Fj(w)  + 


F^  is  a  nondecreasing  continuous  function  determined  by  a  nonnega¬ 
tive  symmetric  spectral  density,  f,  by  the  relation 


(u.) 


f(A)dA. 


F^  is  a  nondecreasing  step  function  determined  by  a  symmetric  spec¬ 
tral  function,  q,  by  the  relation 


F_(aO  =  [  q(A.) 

A .  <u)  J 

,1 


where  A,  (=  A  .)  is  a  discrete  set  of  frequencies.  The  spectral 


mass  or  power  associated  with  a  set  of  frequencies  A  is  determined 


from  the  continuous  and  discrete  components  by  the  expression  (33 | 


I  dF(a))  =  I  q(\.)  -t  f  f(A)dA. 

k  A.cA  '  k 
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That  is,  both  continuous  and  discrete  components  contribute  to  the 
power.  The  spectrum  is  said  to  be  continuous,  discrete  or  of 
mixed  type  according  to  the  three  cases  (F^  ^  0,  =  0) , 

(F^  E  0,  F^  ^  0)  or  (F^  ^  0,  F.^  ^  0) ,  the  last  one  being  the  most 
general  case.  An  important  example  of  a  process  with  a  mixed 
spectrum  is  furnished  by  a  trigonometric  polynomial  with  random 
amplitudes  plus  noise.  This  type  of  process  is  useful  for  our 
purpose . 

By  a  time  Invariant  linear  filter  [33],  [35]  we  mean  a  linear 
operation  L  on  [Z^],  of  the  form 

''c  =  -  Vt  *  Vt-l  Vt-2 

which  produces  a  new  stationary  process  ^he  transfer  func¬ 

tion  H(uj)  is  defined  by  the  sum 

H(M)  =  I  h^e-'"”^. 
r 

1h(w)|  is  called  the  gain  associated  with  the  filter 

When  a  time  invariant  linear  filter  with  transfer  function  H 
is  applied  to  a  stationary  process  the  resulting  process  is 

also  stationary  provided  the  matching  condition 

|H(\);^dF(A)  <  <-o 

is  satisfied.  In  this  case  the  output  spectral  density  and  spec- 

2  2 

tral  function  are  given  by  |H(A)|  f(A)  and  |H(A)|  q(\),  -ti<A<[i, 
respectively. 

In  many  cases  it  is  convenient  to  normalize  by  and  de¬ 
fine  a  new  nuantitv  c.  bv 
■  '  K 


p,  = 


k  =  0,±l. 


p,  is  called  the  correlation  or  autocorrelation  function  and  satis 
k 

fies  I P,  I  <  1. 
k 

One  last  point,  any  finite  realization  of  a  random  process, 
Z^,Z^, .  .  .  ,7.^,  is  called  a  time  series.  Througliout  most  of  the 
paper,  we  deal  with  time  series  of  length  N'  from  stationary 
Gaussian  random  processes  with  zero  mean.  The  only  exceptions  are 


bounded  processes  discussed  very  briefly  at  the  end. 


3.  SOME  GENERAL  PROPERTIES  OF  ZERO- CROSSINGS 

In  this  section  zero  crossing  counts  in  discrete  time  are 
defined  and  are  shown  to  possess  a  certain  spectral  representation 
which  shows  their  intriguing  connection  with  the  spectrum  and 
linear  filtering.  The  representation  also  illustrates  the  funda¬ 
mental  principle  that  the  zero-crossing  rate  tends  to  admit  values 
in  a  neighborhood  of  a  dominant  frequency.  This  principle  is  the 
key  factor  in  many  of  our  results. 


3.1.  A  Zero-crossing  Spectral  Representation 

Let  Z,,...,Z.,  be  a  zero  mean  stationary  Gaussian  time  series. 
IN 


To  define  what  is  meant  by  zero-crossings  in  discrete  time  we  con¬ 
sider  the  associated  clipped  binary  series  defined  by 


1,  Zj.  >  0 


0,  Z^  <  0 


t  =  1 . N, 


and  let  d^  be  the  indicator  function  at  time  t. 


d  =  X  +  X  -  2X  X 
t  t  t-1  t  t-1 


Then  d^  is  0  or  1.  When  =  I  we  say  that  a  zero-crossing 

occurs  at  time  t.  The  number  of  zero-crossings  in  Zp...,Zj^  is 
denoted  by  1)^  and  is  defined  bv  the  sum 


",  =  Jj  +  ...  t  d^. 


As  an  illustration  consider  the  time  series  t  =1,...,]2,  and 

its  associated  two  binary  series  X^ ,  d^ ,  t=  1,...,12  in  Figure  2. 


We  ht 


ave  Dj  =  d^  +  . . .  +  d^^  =  b,  and  it  is  seen  that  the  number 
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Figure  2.  A  time  series  of  size  12  with  6  zero-crossings. 


of  zero-crossings  is  precisely  the  number  of  symbol  changes  in  the 
X  series.  As  a  digression  we  remark  that  the  number  of  symbol 
changes  is  a  so-called  sufficient  statistic  for  stationary  binary 
Markov  chains  and  as  such  this  number  carries  information  about  the 
chains'  parameters  [36]. 

In  the  classical  theory  of  zero-crossings  in  continuous  time 
the  expected  number  of  zero-crossings  per  unit  time  is  a  useful 
quantity  [19],  [  2  ],  and  it  is  interesting  to  see  where  ED^ ,  the 
expected  number  of  zero-crossings,  leads  us  in  the  present  case 
of  discrete  time. 

Since  is  Gaussian  with  zero  mean,  EX^  =  1/2  and  (37] 


EX  X  , 
t  t-1 


1  .  1 


which  gives  ED  = 


1  1 


-1 


2  "  n  ^'l 
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and  tt)erefore 


ED^  =  (N-l)(|  -  sin  ^D^). 


-  V-' .  'v  -j 


. 


2 


By  rearranging  terms  we  obtain  the  basic  formula  [36] 


(  \ 

Pi  =  N-rj* 


(1) 


But  from  the  definition  of  and  by  the  Wiener-Khintchine  Theorem 


follows  the  relation 


/•rrED, 

[-Y- 


°i  ]  _ 


f 

J-TT 


cos(u))dF((ji)) 


(2) 


dF(w) 


-Tr 


to  which  we  refer  as  the  zer>o-cTK>ssing  spectral  representation  [27], 
and  one  which  plays  an  important  role  in  our  development.  In  par¬ 


ticular,  suppose  is  operated  on  by  a  linear  filter  L  with 


transfer  function  H.  Then  the  output  is  again  Gaussian  with  mean 


zero  and  its  spectrum  is  given  by  |H(u))j  dF(oj).  Let  denote  the 


number  of  zero-crossings  in  the  filtered  series  L(Zj, L(Z^) . 


Then  (2)  Implies  the  representation 

/•TT 


\  N-L 


COS 


cos(a))  H(w))  dF(u>) 


-IT 


(3) 


H(to)  dF(a)) 


-71 


(3)  connects  zero-crossings  and  linear  filtering.  Clearly,  when  L 
is  the  identity  (or  do  nothing)  filter  then  H  =  1  and  (3)  re¬ 
duces  to  (2)  as  it  should.  We  can  now  extend  (3)  to  any  sequential 
filtering.  Thus  the  sequential  filter  L^L^. . . with  transfer 
function  generalization 


3 


1 


'ttED, 


cos 


H, .  .  .H,  1 
1  k 


N-1 


r 

COS 

■'-TT 

1 

J 

-IT 

cos((*))  |Hj^(u))  I  ^. .  .  I  Hj^(w)  I  ^dF(u)) 


(4 


H^(a))j  ...  |H,Jco)rdF(u)) 


where  ^  is  the  zero-crossing  count  in  the  sequentially 

1' '  ‘  k 

filtered  time  series.  In  subsequent  sections  some  interesting 
properties  and  corollaries  of  (3),  (4)  are  studied  and  discussed 
in  some  detail.  Of  central  importance  to  us  is  the  degree  to  which 
zero-crossings  before  and  after  filtering  determine  F. 

The  idea  Lliat  zeros  of  filtered  signals  are  of  relevance  in 
applications  is  not  entirely  new.  For  example,  in  the  continuous 
time  case  in  order  to  determine  the  expected  number  of  extrema  per 
unit  time  we  simply  find  the  expected  number  of  zeros  per  unit 
time  in  the  derivative  of  the  random  signal  [19].  Upcrossings  of 
differenced  time  series  are  discussed  in  [38]  while  in  [39]  a 
filter  is  applied  prior  to  some  counting  procedures.  The  fact 
that  an  application  of  a  filter  affects  the  zero-crossing  count 
is  also  recognized  in  [19]-  (3)  was  introduced  in  [27],  [40]  for 

the  purpose  of  defection  of  periodicities  in  time  series.  The  con¬ 
tinuous  time  analog  of  (2),  (3)  is  known  <is  Rice's  formula  |19l. 
Another  analog  for  a  non-Claussian  case  is  given  in  [41  |. 


3.2.  The  Dominant  Frequency  Principle 

When  F(io)  is  continuou;  at  the  origin,  an  assumption  we  shall 
adopt  throughout  the  paper,  symmetry  implies  that  (2)  can  be  ex¬ 
pressed  in  terms  of  positive  frequencies  as  in  the  expression 


/ITT’ 11  \ 


I  OS  (10)  tIF  (id) 


C 


5 
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This  shows  clearly  that  the  normalized  expected  zero-crossing  rate 
■TrEDj^/(N-l)  is  a  weighted  average  of  the  spectral  mass.  Therefore, 
when  a  certain  frequency  band  becomes  dominant,  i.e.,  carries  more 
power  than  other  bands,  it  attracts  the  normalized  expected  zero- 
crossings  and  7tED^/(N-1)  admits  values  in  this  band.  Likewise 
when  a  certain  frequency  say,  becomes  significantly  dominant 
then  (5)  implies  that  ttED^^/CN-I)  -  0)^.  In  the  extreme  case  when 
only  uJq  is  present,  that  is,  for  oi  e  [0,7t] 


FCclH-)  -  F(a)-)  >  0, 

=  0, 


U)  =  OJ, 


we  have  the  equality 


ttEDj^ 

N-1 


O), 


Thus,  replacing  ED^  simply  by  ,  we  can  see  that  when  a  certain 
frequency  becomes  dominant  the  quantity  nDj^/(N-l)  will  land  at  or 
near  this  frequency.  In  other  words,  a  dominant  frequency,  when 
it  exists  in  the  spectrum,  can  be  quickly  detected  by  zero- 
crossings  only.  Now,  when  a  time  invariant  linear  filter  is 
applied  to  a  stationary  time  series  it  modifies  the  spectral 
weiglit  given  to  the  frequencies  in  tiie  range  [O.tt),  emphasizing 
some  bands  while  attenuating  ntliers.  Consequently,  when  a  dis¬ 
crete  frequency  exists  in  the  spectrum  it  can  be  enhanced  by  a 
filter  and  then  estimated  by  ttD  ,/(N-l).  This  tendency  of  zero- 
crossings  (after  proper  scaling)  to  admit  values  in  a  neighborhood 
of  a  significantly  dominant  frequency  will  be  called  the  dominant 
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frequency  principle.  This  principle  can  be  recognized  as  the  basis 
of  many  of  our  theoretical  results.  See  also  [Al],  [42]. 

To  illustrate  this  principle  and  its  application  in  fast  fre¬ 
quency  detection  consider  the  superposition 

=  Acos(0.8t)  +  Bcos(1.25t),  t  =  l,...,200.  (6) 

Realizations  of  this  series  before  and  after  filtering  are  plotted 
in  Figure  3.  The  graphs  were  scaled  but  this  has  no  effect  on 
zero-crossings!  From  (2)  or  equivalently  (5)  we  know  that  is  a 
function  of  the  weights  A,  B.  When  A  =  B  =  1,  no  frequency  is 
dominant.  We  obtain  from  Figure  3  (a)  =  67  and 

TrDj^/199  =  1.057  which  is  between  0.8  and  1.25  as  expected.  When 
A  =  0.8  and  B  =  1,  the  frequency  1.25  is  dominant.  From  Figure  3 
(b)  we  obtain  =  79  and  ttDj/199  =  1.247  is  very  close  to  the 
dominant  frequency  in  agreement  with  the  dominant  frequency  prin¬ 
ciple.  By  operating  on  this  second  series  with  a  low-pass  filter 
we  can  render  the  frequency  0.8  dominant.  Figure  3  (c)  gives  the 
scaled  graph  of  the  series  in  (b)  after  it  was  operated  on  with  a 
low-pass  linear  filter  with  transfer  function  (1  +  e  )^.  Now 

the  frequency  0.8  is  dominant,  D  =  51  and  7rD^,/199  =  0.805 

Hi  Hi 

as  expected. 

At  this  point  the  reader  may  be  puzzled  by  the  fact  that  (6) 
does  not  at  all  look  random  while  our  theory  pertains  to  random 
time  series.  This  ambiguity,  should  it  arise,  will  be  resolved  in 


Section  4. 1. 
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D,  =  67,  N  =  200 


(a) 


ttD 

= cos(0. 8t)  + cos(l. 25t) .  Neither  frequency  is  dominant.  —  =  1.057. 


Figure  3.  Demonstration  ol  the  dominant  trequency  principle.  Zero¬ 
crossing  ((Mints  detect  dorriinint  t  re(|  ueiic' i  es .  All  grajdis  are  scaled. 


3.3.  Zero-crossings  of  Repeatedly  Differenced  Series 


A  useful  filter  associated  with  zero-crossings  is  the  differ¬ 
ence  operator.  To  introduce  this  operator  it  is  convenient  to  de¬ 
fine  the  shift  operator  B  as 


BZ 


t 


Z 


t-1 


with  transfer  function  e  The  difference  operator  V  is  defined 

by  V  =  1  -  B  with  transfer  function  1  -  e  Then  the  first 

difference  of  [Z  }  is 
t 


VZ  S  (1  -  B)Z  =  Z 
t  t  t 


-  Z 


t-1 


and  the  second  difference  is 


V^Zj.  =  V(VZ^)  c  (l-B)^Zj_  =  Zj.  -  2Z^_j  +  Z^_2 


In  general  the  k' th  difference  of  (Z^}  is  given  by 

V‘"Z  ^  (I  -B)’"Z  =  I  (  ^)(-l)'^Z  . 

j=0  J  J 

Def ine 

k- 1 

=  >1  of  zero-crossings  by  V  Z^,  t  =  l,...,N. 


Then  D,  is  as  before  Ciie  number  of  zero-crossings  in  Z,,...,Z,, 
i  IN 

while  D,,  is  the  number  of  zero-crossings  in  VZ,,...,VZ^,  and  D. 

Z  i  N  3 

Z  ? 

is  the  number  of  ze  ro- crossi  ngs  in  V  Z  ,...,V  7.  and  so  on.  The 

J  N 

are  called  higher  order  crossings  (A31,  [44J.  Wlien  fZ^.}  is  first 

operated  on  by  a  filter  i  with  transfer  function  H  and  is  then  re¬ 


peatedly  differenced,  we  use  t  lie  notation 
crossings  in  the  filtered  series  f(Z  ). 


to  denote  liigher  order 


ini  iri  p^v^wvv^.v^r^v^  rrrwwT\’f  r»  g»  ■  v  ■^V^JI 
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Unlike  the  process  (Z^}  which  is  defined  over  all  the  integers, 
in  practice  we  only  have  finite  time  series  and  lose  an  observation 
with  each  difference.  To  avoid  end-effects  we  must  discard  the  be¬ 
ginning  of  the  series,  and  start  indexing  the  observations  or 
samples  by  moving  to  the  right.  Thus  if  it  is  desired  to  evaluate  k 
higher  order  crossings  the  time  index  t  =  l  is  given  to  the  k'th 

or  to  a  later  observation.  The  D.  must  be  evaluated  from  differ- 

J 

enced  records  of  the  same  length.  For  example,  suppose  the  given 
record  is 


16  1 

7  8 

9  2  3  0  7. 

Subtracting 

the  average  we 

have 

the  centered  series 

-3.  A 

1.6  -3.4  2.6 

3.6 

4.6  -2.4  -1.4  -4. 

4  2.6  . 

In  order  to 

evaluate,  say. 

Dp  1 

D^,  we  record  the  series  starting 

with  =  ■ 

-3.4  and  = 

2.6 

while  reserving  the  first  -3.4  and 

1.6  for  Z  ^ 

and  Z^  respectively 

.  We  can  now  evaluate 

Z  ,  VZ  ,  V^Z 

for  t  =  1 , . 

.  .  , 8  as 

Z: 

-3.4  2.6 

3.6 

4.6  -2.4  -1.4  -4. 4 

2.6 

VZ: 

-5.0  6.0 

1.0 

1 

o 

o 

1 

O 

7.0 

V^Z: 

-10.0  11.0  ■ 

-5.0 

o 

1 

oc 

s 

CO 

1 

o 

o 

10.  00 

Then  = 

3,  =  5, 

=  7 

and  arc  all  evaluated 

from  records 

of  length  8 

Observe  Lh.'it 

by  our  definition  a  shift 

from  a  negative 

value  to  0 

is  counted  as  a 

crossing.  Similar  remarks 

apply  to  other 

filters  as  well. 


Since  V  is  a  high-pass  filter  it  pushes  the  spectral  mass  to 


the  right  so  that  {Vz^}  becomes  more  oscillatory  than  {Z^}.  We 

2 

therefore  expect  that  D2  >  D^.  V  amplifies  high  frequencies  even 

2 

more  than  V  so  that  {V  Z^}  is  more  oscillatory  than  {VZ^}  and  we 
expect  that  >  D2,  and  so  on.  Also,  if  cj*,  say,  is  the  highest 

frequency  present  in  the  spectrum,  we  see  that  by  applying  to 
the  filter  V  and  letting  k  the  power  is  pushed  all  the  way 

to  cj*  and  renders  it  dominant!  Therefore  by  the  dominant  frequency 
principle  7tDj/(N-1)  should  converge  in  some  sense  to  u)*.  This 
heuristic  argument  is  made  precise  in  terms  of  the  sequence  of  ex¬ 
pected  higher  order  crossings  127],  [28]. 

Theorem  1.  Suppose  {Z^},  t =  . . . ,-l, 0, 1 , . . . ,  is  stationary  and 

Gaussian  with  mean  0  and  suppose  o)*  is  the  highest  frequency  in  the 

A 

spectrum,  w  £  tt.  Then 

(i)  0  <  <  ED^  <  ...  <  (N-1) 

TTED. 

(ii)  —3^  ->■  0)*,  j  0°. 

The  theorem  says  that  the  sequence  {ttEDj /(N-1) )  is  monotone  and 
bounded  and  therefore  it  converges  to  its  least  upper  bound  which 
is  0)*.  Under  fairly  general  conditions  we  can  also  show  [2  7]  that 

lim  lim  Var(D./N)  =  0. 

j  ->00 

It  follows  that  TT[)^/(N-1)  (without  "E")  approximates  a)*  for  suffi¬ 
ciently  large  j  and  N.  As  we  shall  sec  in  the  next  section  ttiis 
estimator  converges  as  j  increasi.’s  remarkably  fast  provided  U)*  is 
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sufficiently  removed  away  from  it.  This  discussion  explains  why  1)2 
is  in  general  a  better  choice  than  in  the  estimation  of  the 
highest  frequency  [25],  [26],  [27]. 

The  monotone  property  of  the  sequence  in  (i)  follows 

from  (1)  and  the  relation 

Pj  >  Py(l)  (7) 

where  Py(l)  is  the  first  correlation  in  [VZ^]  [28]-  In  general  the 
inequality 

D.^,  >  D.  -  1  (8) 

J  +  I  1 


holds  for  j  ^  Experience  shows  however  that  we  really  have 

D..,  >  D.  for  sufficiently  long  series;  e.g.  N  =  500  [43].  [44]. 

J+1  -  J 

Figure  4  gives  the  graphs  of  two  time  series  of  length  N  =  200 
from  a  stochastic  difference  equation 


^  *  “t 


(9) 


where  fu^}  are  uncorrelated  normal  random  variables  with  mean  zero 
and  =  ±0.5.  It  is  seen  that  even  for  these  relatively  short 

series  the  first  few  D,  tend  not  to  decrease.  (9)  satisfies  the 

J 

requirement  for  being  a  stationary  Gaussian  process  when  |(J)|  <1 
[45]. 

When  some  of  the  "toucli",  it  is  an  indication  of  a  strong 
periodic  component  in  the  data.  Consider  the  extreme  case  of  a 
sinusoid  with  a  certain  frequency.  Ihen  the  diflerences  are  again 
sinudoids  with  the  same  frequency  but  different  amplitudes,  and  by 
the  dominant  frequency  principle  all  the  ED.  are  the  same  provided 
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(a) 


Figure-  4.  Higher  order  erossings  from  (9),  N  =  200. 


they  are  obtained  from  differeni:ed  series  of  the  same  length.  For 
a  Gaussian  series,  we  can  show  that  the  converse  holds  as  well  [28] 
[291-  This  is  phrased  in  the  following  theorem  which  is  called  the 


Sinusoidal  Limit  iheorem  (Sl.T). 


Theorem  2  (SLT) .  Let  be  a  zero-mean  Gaussian  stationary 

process.  Assume  ED^  >  0,  and  suppose 


1  2 


Then  {Z^  }  is  a  pure  sinusoid  with  period  2(N-1)/ED^. 


The  surprising  fact  is  that  when  (10)  holds  alt  the  ED.  are  equal. 

A  more  general  Sinusoidal  Limit  Theorem  of  which  i'heorem  2  is  a 
special  case  is  due  to  Slutsky  [46]. 

Perhaps  the  most  general  statement  that  can  be  made  about 
higher  order  crossings  is  that  they  determine  uniquely  the  spectral 
distribution  function  F  up  to  a  constant.  To  see  that,  first  note 
that  the  higher  order  crossing  spectral  representation  is  given  by 


from  which  we  obtain  after  some  algebra  the  "long  formula"  [46], 


where  7  now  operates  on  and  (12)  provides  a  recursion  for 

obtaining  from  ED  ^  ,  ED,^ ,  .  .  .  .  For  example,  for  k  =  0 


(12)  reduces  to  (1)  so  that  is  deti-rmined  from  KD^  .  For  k  =  1 
(12)  g ives 


=  1  -  2(1  -  cos(^^))(l  +  cos(  — 
while  for  k  =  2  p_  is  determined  similarly  from  ED  ,  ED  ,  ED  . 

5  1  ^  i 

In  general  is  determined  from  (ED^ , .  .  .  ,  EDj^)  .  Now  recall  that  Pj^ 
is  the  k'th  Fourier  coefficient  of  F(uj)  =  F(o))/Yq,  and  so  the 
uniquely  determine  F.  But  from  (12)  the  sequence  is  uniquely 

determined  by  liie  sequence  of  higher  order  crossings  { ED^  } .  It 
follows  tliat  F  is  completely  determined  by  i  heve 

Theorem  3.  For  a  zero  mean  stationary  Gaussian  process,  the  sequence 
of  expected  higher  order  crossings  uniquely  determines  the 

normalized  spectral  distribution  function  F. 

The  above  discussion  can  be  summarized  by  the  symbolism 

fED.}  4=^  {p  }  4=^  F  (13) 

1 

which  is  another  indication  that  zero-crossings  of  filtered  series 

contain  useful  information  about  the  spectral  properties  of  the 

})roces.s.  (  13  J  may  be  viewed  as  a  ramification  of  the  Wiener- 

Khintchine  relation,  and  is  an  evidence  of  the  existence  of  a 

domain  to  which  we  refer  rather  loc'sely  as  the  D-domain.  The  D. 

1 

however  are  iiy  no  means  the  only  features  in  this  domain  as  is  evi¬ 
dent  from  the  next  section. 

3.4.  /.ero-cross  ings  of  Repeatedly  Summed  and  Differenced  Series 

file  counterparts  of  the  D.  are  zero-crossings  of  repeated  sum- 

) 

m.it  ion.  (Consider  the  summation  filter  (i+B)  whleli  gives 

(1+B)Z  =  Z  +  Z. 


Define 


=  if  of  zero-crossings  by  (1+B)^  t  = 

Then  >  3nd  again  we  refer  to  the  higher  order 

crossings.  VJhereas  the  D.  tend  to  increase,  the  .D  tend  to  de- 

J  J 

crease,  but  in  many  respects  these  two  types  of  higher  order 
crossings  are  quite  analogous.  Thus,  the  counterpart  of  Theorem  1 
is  the  following  fact. 

Theorem  4.  Suppose  {Z^},  t -... ,-1,0,1,. .. ,  is  stationary  and 
Gaussian  with  mean  0,  and  suppose  ^o)  >  0  is  the  lowest  frequency 
in  the  spectrum.  Then, 

(i)  E  >  E  2D  >  .  . .  >  0 

7TE  .D 

Evidently  Theorem  4  is  another  manifestation  of  the  dominant  fre¬ 
quency  principle,  and  provides  a  way  for  estimating  the  lowest  fre¬ 
quency.  Again  we  can  obtain  a  "long  formula"  as  in  (12)  in  terms 
of  {EjD},  by  replacing  in  (12)  all  the  minus  signs  by  plus  signs, 
which  establishes  the  equivalence  of  this  sequence  of  zero-crossings 
and  F. 

Tliere  is  no  reason  why  we  cannot  put  the  D.  and  .U  together 

.11 

to  produce  more  versatile  features.  A  more  general  definition  of 
higher  order  crossings  is  the  following.  Consider  the  filter  de¬ 
fined  by  m  differences  which  follow  n  sums. 


whose  squared  gain  is  given  by 


|G(cj)j^  =  -  cos  +  cos 


(15) 


Define  the  sequence  by 

k-1  i-1 

.D  =  //  of  zero-crossings  in  (1-B)  (1+B)'^  Z  ,  t  =  l,.,.,N. 

J  t 

Again  we  refer  to  the  .D  as  higher  order  crossings.  Evidently,  by 

J 

definition 


°1  "  1^  = 


because  in  these  cases  no  filtering  is  applied  to  {Z^}.  When  m,n 


are  large  and  m/n  =  c  then  ir  (N-1)  will  tend  to  admit  a  val 
in  the  neighborhood  of  A  where 


ue 


^  (lif  ) ■ 


provided  A^  is  a  point  in  the  spectrum.  This  is  so  since  |  G((jj)  |  " 
in  (15)  is  uniraodal  with  a  peak  occurring  at  A^  which  makes  this 
frequency  dominant. 

When  prior  to  the  application  of  (14)  the  series  is  operated 
on  by  a  linear  filter  with  transfer  function  H  we  shall  use  the 
notation 


j^Hk 


to  denote  the  higher  order  crossings  in  the  filtered  series.  If 


the  filter  is  just  the  summation  filter  1+B,  then  clearly 


4.  SPECTRAL  ANALYSIS  BY  ZERO-CROSSINGS 


Our  discourse  leads  us  naturally  to  the  important  problem  of 
spectrum  analysis.  The  relevance  of  higher  order  crossings  to  this 
problem  will  become  self-evident.  A  highlight  of  this  section  Is 
the  design  of  a  filter  used  in  the  delicate  problem  of  detecting  a 
very  weak  signal  from  higher  order  crossings. 


4.1.  Analysis  of  Discrete  Spectra  by  Zero-crossings 

We  are  now  finally  ready  to  apply  the  above  results  in  dis¬ 
crete  spectrum  estimation.  For  this  purpose  consider  the  process 


Z 


t 


(A. cos  oi.t  +  B.sin  cj.t 
J  J  J  J 


), 


(16) 


t=... ,-1,0,1,...,  where  the  a>.  and  p  are  constants  and  { A , } ,  {B  , } 

J  J  J 

are  taken  as  uncorrelated  normal  random  variables  such  that 


and 


EA.  =  EB.  *  0 
J  J 


for  all  j 


EA.A, 

1  j 


EB^Bj  =  0, 
2 

=  o. 


i  ^  j 
i  =  j 


EA.B.  =  0 
1  J 


for  all  i,j. 


Further,  without  loss  of  generality  we  assume  that 

0  <  ui,  <  (i).  <  .  .  .  <  (i)  <  TT.  It  follows  [32]  that  {Z  }  is  a  sta- 
i  Z  p  ~  t 

t ionary  Gaussian  process  with  mean  zero.  Its  spectral  distribution 

2 

function  is  a  step  function  witli  jumps  of  magnitude  of  size  'iOj  at 
iiiip  j=l,...,p.  In  order  to  rea lize  Z^,  t  =  l,...,N,  it  is  useful 
to  think  of  the  {A.},  fB.}  as  first  being  determined  by  some  random 


mechanism  so  that  they  become  just  constants  throughout  each  reali 
zation.  This  resolves  the  ambiguity  mentioned  earlier,  for  the 
constant  values  there  may  be  thought  of  as  being  the  constant 
values  of  random  variables.  The  point  being  made  is  that  what  we 
aatuallij  observe  is  a  trigonometric  polynomial  of  the  form  (16) 
with  some  unknown  coefficients.  Processes  of  the  form  (16)  are 
ideal  for  an  illustration  of  all  the  foregoing  discussion. 

The  spectral  representation  of  the  ED^  (11)  reduces  to 


\  p 

(  )  =  I  W^(j  ,w)cos(a3^) , 

\  ‘  /  r=  1 


where 


W^(j.,^) 


^  r  sin  CO,  /2  “1 


?  /  !i  \ 

\  o  )  sin  CO 
1=1  '  r  '  r 


An  interesting  representation  analogous  to  (17)  has  been  obtained 


in  [Al].  It  is  seen  that  W  (j  ,cj)  >  0  and  W  ( 

r=l  ’’ 


I  W  (j, 


co)  =  1.  Als( 


since  the  co^^  are  ordered  in  (0,tt]  and  since  sin(x/2)  is  monotone 
there  we  have 


W^(j ,w) 


0,  r  ^  p 


1 ,  r  =  p 


[’here fore  as  j 


COs((jO  )  , 

p 


or,  since  cos(x)  is  monotone  in  [0,tt], 


and  we  have  demonstrated  (ii)  of  Theorem  1  in  the  purely  discrete 
spectrum  case.  That  is,  the  normalized  ED^  converge  to  the  highest 
angular  frequency  Similarly,  directly  or  from  (il)  of  Theorem  4 

ttE  ,D 


Now  (2)  reduces  by  (17)  to 


(ttEDj^  \  a^cos  +  . . 


.  +  O  cos  (U 
_ 2 _ 2 

P 


Suppose  0^  -*•  ®.  That  is,  suppose  becomes  dominant  as  expressed 
by  associating  more  power  with  the  amplitudes  A^,  Then  (20) 

obviously  converges  to  cos(w^)  and  we  have  by  mono  tonicity 


in  accordance  with  the  dominant  frequency  principle.  Another  useful 
fact  due  to  (17)  is  that 


tied  ttED 
*^1  -  - 


.  <  U) 


Observe  that  ui,  =  u)  if  and  only  if  7rED,/(N-l)  =  ttED  /(N-1)  by 
ip  1  z 

the  SLT. 

(18),  or  equivalently  (19),  provides  a  readily  available  method 
for  obtaining  all  tlie  w' s  and  p  as  shown  in  the  tollowing  algorithm. 
We  assume  tfiat  N  is  sufficiently  large  so  that  EIJ./(N-1)  in  (18)  can 
be  replaced  by  D./(N-1)  for  all  practical  purposes. 


Algorithm  1.  Computation  of  OJ  and  p  given 

—  -  Ip  IN 


<0)  <  IT. 

P  " 


from  the  harmonic  process  (16).  Assume  0  <  <  u)2 

Step  0:  Let  p*  =  0. 

Step  1:  Determine  (jJ  from  lim  itD./(N-1). 

P  •  j 

If  ojp  =  0,  go  to  Step  4. 

If  (i)  ^  0,  let  p*  =  p*  +  1.  Print  co  . 

P  K  p 

Step  2:  Filter  out  with  an  ideal  low-pass  filter  with 
cutoff  frequency  tj~. 

Step  3:  Replace  p  by  p  -  1-  Go  to  Step  1  with  the  filtered 
series  obtained  in  Step  2. 


Step  4:  Let  p 


Print  p.  End  computation. 


When  p  is  known  apriori  the  algorithm  can  be  easily  amended  to 
stop  after  the  detection  of  p  frequencies.  Also,  in  practice  we  do 
not  have  ideal  filters  and  must  be  content  with  approximations  [33]. 
[35]. 

To  demonstrate  the  essence  of  the  algorithm  consider  the  sim¬ 
pler  case  when  p  is  known.  Suppose  (16)  is  of  the  form 

Z^  =  cos(0.8t)  -  1. 3cos(l. 25t)  +  sin(2t),  t=l,...,200,  ( 

where  p  =  3.  This  is  a  fairly  short  series  and  it  is  Interesting 
to  see  what  the  algorithm  produces  in  this  case.  Instead  of  a 
close  to  ideal  low-pass  filter  we  use  the  much  cruder  low-pass  fil- 
ter  (1+B)  which  attenuates  high  frequency  and  which  suffices  for 
this  demonstration.  The  results  with  only  eight  higher  order 
crossings  are  given  in  Table  I  and  are  quite  satisfactory  given  tlie 


length  of  the  data. 


The  column  corresponding  to 


D,  starts  too 


1 

77 

50 

47 

2 

81 

61 

48 

3 

89 

79 

48 

4 

108 

79 

48 

5 

126 

80 

49 

6 

126 

80 

49 

7 

127 

80 

49 

8 

127 

79 

50 

Estimated 

highest 

oj  =  2.005 

=  1.247 

u)  =  0. 

frequency 

lini 


3 


I 


low  due  to  an  end  effect  produced  by  the  shortness  of  the  original 
series  which  was  220.  For  another  example  using  the  slne-Butterworth 
low-pass  filter  see  [40]. 


4.2.  Mixed  Spectrum  Analysis  by  Zero-crossings 

Consider  again  the  harmonic  process  (16),  but  suppose  we  now 
add  to  it  a  noise  component  to  form  a  signal  plus  noise  process. 


^t  = 


j  =  l 


(A. cos  u.t  +  B.sln  w.t)  +  e  , 
J  J  J  J  t 


(24) 


where  the  A.,  B.,  w,  are  as  in  (16)  and  {Ej.}  is  white  noise  made  of 

2 

uncorrelated  normal  random  variables  with  mean  0  and  variance  O  . 

e 

1 1  is  assumed  that  is  independent  of  the  Then  {2^.1 

is  again  a  stationary  Gaussian  process  with  zero  mean  since  it  is 
the  sum  of  two  independent  Gaussian  components,  the  signal  and  the 
noise.  As  before  the  problem  is  to  determine  the  0)^  and  p.  This 
time  however,  the  problem  is  more  involved  since  (Gj.)  adds  to  the 
process  a  continuous  spectrum  whose  highest  frequency  is  it.  There¬ 
fore  by  Theorem  1,  TrDj/(N-l)  will  converge  to  tt  and  not  to  co^.  It 
is  true  that  0)^  is  still  the  highest  discrete  frequency,  but  this 
is  of  no  great  help  now.  It  follows  that  the  technique  of  the  pre¬ 
vious  section  is  not  in  general  suitable  for  the  mixed  spectrum 


case . 

Luckily  enough,  the  dominant  frequency  principle  prevails  in 
this  case  too  but  in  a  rather  subtle  way  which  requires  an  explana- 
t  ion . 

The  zero-crossing  spectral  representation  (2)  for  the  signaJ 
plus  noise  process  (24)  reduces  to 


a. cos  w.  +  +  a  cos  oj 


from  which  we  can  see  directly  that  when  becomes  dominant  (by 
2  2  2 

rendering  o  larger  than  a  and  a.,  j  ^  r)  it  can  be  estimated  by 
r  ^  J 

ttDj^/(N-1).  Evidently,  our  problem  is  solved  if  we  can  render  the 

Li's  dominant  in  succession  starting  with 

Let  us  envision  an  ideal  situation  where  becomes  dominant 

due  to  a  low-pass  filter  which  pushes  the  spectral  mass  to  the  left. 

Then  ttd  ,/(N-l)  will  detect  it.  By  applying  the  high-pass  differ- 
H  i 

ence  filter  V  the  power  shifts  to  the  right  and  suppose  0)2  now  be¬ 
comes  dominant  instead.  Then  will  detect  it.  Ideally, 

we  apply  V  again,  push  the  power  even  further  to  the  right,  render 
dominant  and  detect  it  by  (N-1) .  This  iterative  process 

can  be  repeated  until  all  the  w's  are  detected  by  the  .  This 

idealization  cannot  of  course  be  expected  to  hold  exactly,  but  the 
degree  to  which  it  does  hold  is  surprising.  The  fact  observed  in 
numerous  cases  is  that  the  normalized  higher  order  crossings 

rD  /(N-1)  tend  to  "visit"  the  oj's  on  their  way  to  the  highest  fre- 
nj 

quency  as  j  increases.  The  same  holds  true  for  tt  (N-1)  by 
symmetry,  except  that  now  the  sequence  tends  to  "visit"  the  w's  on 
its  way  to  the  lowest  frequency  as  k  increases.  This  fidelity  of 
higher  order  crossings  has  been  first  observed  in  [27). 


To  demonstrate  the  curious  tendency  of  normalized  D  .  to  either 

nj 

land  on  or  pass  very  near  the  u)^ ,  consider  again  the  process  In  (6) 
with  additive  random  noise. 


t  =  1, . . . ,200, 


(2 


33 


=  0.8cos(0.8t)  +  cos(1.25t)  +  e^. 


Var(E^)  =  1/12.  For  a  low-pass  filter  we  use  (1+B)  so  that 

^  lO^j  ‘  Tsble  2  we  see  that  without  prefiltering  the  se¬ 

quence  {110^/199}  starts  off  near  the  dominant  frequency  1.25  and 
then  increases  toward  it.  Consequently  no  information  is  available 
about  the  smaller  frequency  0.8.  But  by  applying  the  low-pass 
filter,  the  sequence  {Tr,^D./199}  starts  off  near  0.8  and  then  as 

10  j 

it  increases  it  also  visits  the  second  1.25.  Thus  both  frequencies 
were  visited.  For  more  examples  see  [27]. 


j 

No 

D. 

J 

filtering 

TTD./199 

J 

Application 

lo'-j. 

of  (1+B)’ 
UloD/199 

1 

78 

1.231 

51 

0.805  (*) 

2 

79 

1.247  (*) 

51 

0.805  (*) 

3 

95 

1.500 

66 

1.042 

4 

110 

1.  737 

78 

1.231 

5 

130 

2.052 

79 

1.247  (*) 

6 

144 

2.273 

79 

1.247  (*) 

7 

154 

2.431 

80 

1.263 

8 

160 

2.526 

80 

1.263 

9 

164 

2.589 

79 

1.247  (*) 

10 

166 

2. 621 

82 

1.295 

Table  2.  Higher  order  crossings  from  (26)  visit  and  A 

visit  is  indicated  by  (*). 


Original  series  (26).  N  =  200. 


Figure  6.  The  scaled  series  (26)  before  and  afier  filtering 
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4.3.  Detection  of  a  Weak  Signal 

!  The  fact  that  the  sequence  {tt  D  /(N-1)}  tends  to  contain 

I  k  Hj 

values  which  are  attracted  to  discrete  frequencies  in  the  spectrum, 
provides  a  powerful  tool  in  detection.  This  will  now  be  illus- 

I 

I 

I  trated  by  applying  the  foregoing  ideas  in  the  estimation  of  a 

single  frequency  of  a  sinusoid  buried  in  random  noise.  In  this 
connection  we  make  use  of  the  periodogram  evaluated  at  higher  order 


(■  rossings , 


N 

I  7  exp(- 
t=l 


in  D  . 
k  H.1 

N-1 


j  =  1,2 . 


(27) 


and  i  =  /-I.  When  a  certain  n ,  D,, ./ (N-1)  is  close  to  a  discrete 

k  Hj 

frequency,  the  periodogram  will  in  general  be  inflated.  In  this 
way  we  can  tell  which  of  the  normalized  higher  order  crossings 
^  landed  at  or  near  a  discrete  frequency. 

Periodogram  analysis  is  an  important  tool  in  time  series 
analysis  [47 |,  [48].  Usually  the  periodogram  is  evaluated 

at  points  of  tlie  form  2“k/N,  k  =  0 ,...,( N/ 2  j  ,  and  the  idea  is  to 
ol)servo  significantly  larv  p,  •'ictdogram  ordinates.  A  point  2vk/N 
wliich  gives  rise  to  an  inf  i..  ted  periodogram  ordinate  indicates 
that  the  data  conttiin  a  periodic  component  with  period  N/k.  In 
our  use  of  (27),  the  periodogram  is  evaluateu  at  on  1  v  a  points 

which  are  not  necesstirilv  o)  tlie  form  2”k/N. 

Consider  the  proi’ivss  (24)  with  p  =  1, 

Z  =  A,  cos  w,t  +  B.sin  (.i  t  +  t  ,  (28) 

t  1  I  1  1  t 

I 

I 


t  and  again  suppose  we  want  to  estimate  from  zero- 

crossings.  This  problem  was  considered  in  [  1  ],  [3],  [39],  [41], 
[42],  [49],  [50]-  It  is  actually  possible  [49]  to  solve  for  in 
terms  of  and  but  it  was  shown  in  [50]  that  the  solution  may 
be  extremely  sensitive  to  small  changes  in  these  higher  order 
crossings.  The  incorporation  of  certain  filters  gives  more  robust 
es  t  ima  tes . 

The  first  order  low-pass  filter  is  defined  by  the  recursion 

L(Z^)  =  (l-a)Z|.  +  aL(Z^_p,  (2 

0  <  a  <  1.  Its  squared  gain  is  given  by  [33] 

- . 

1  -  2acos  10  +  a 

The  combination  of  this  filter  and  repeated  summati(m  was  found 
very  useful  in  higlier  order  crossing  analysis  owing  much  to  the 
following  fact  [50]. 

Theorem  5_^  Let  be  given  by  (28)  and  without  h)ss  of  generality 

assume  that  0  <  <  tt/Z.  Then  regardless  ot  the  s  i  gna  1  -  to-no  ise 

ratio,  a , /a  ,  we  have 
1  e 

(a)  For  I  sufficiently  close  to  0 

"’•■^ci  -1 

with  equality  holding  wlien  rt  =  cos((,ij). 

(b)  For  't  sufficiently  close  to  1  I  lie  inequalities  are  reverseti. 


3 


Theorem  5  tells  us  that  as  a  increases  away  from  0  toward  1, 

ttD  /(N-1)  has  a  good  chance  for  "visiting"  U)  .  Our  strategy  in 

light  of  the  foregoing  discussion  is  to  increase  a  in  small  steps 

and  then  evaluate  (27)  at  each  step.  When  a  visit  occurs,  the 

periodogram  will  be  inflated.  Since  the  theorem  holds  regardless 

of  the  signal- to-noise  ratio,  this  procedure  gives  hope  in 

detecting  even  very  weak  signals.  For  small  tt  is  perhaps 

better  to  prefilter  with  (1+B)^  for  some  k,  e.g.  k  =  2. 

As  an  illustration  consider  a  computer  simulation  of  (28)  with 

=  0.75,  ~  -20dB,  N  =  2000.  For  each  a-value, 

a  =  1/30, ..., 10/30,  we  obtained  the  zero-crossings 

normalized  D  and  the  corresponding  relative  (out  of  10  cases) 

J  01 

periodogram  ordinates  are  given  in  Table  3.  The  highest  relative 
periodogram  ordinate  corresponds  of  course  to  0.75  as  it  should. 

The  more  startling  fact  though  is  that  the  normalized  corre¬ 

sponding  to  a  =  0.1  landed  exactly  on  co^. 


a 

TT  ,^0^^/1999 

Relative  periodogram 
ordinate 

1/30 

0.  791 

0.121 

2/30 

0.  765 

0.049 

3/30 

0.  750  (*) 

0.  583 

4/30 

0.  724 

0.  no 

5/30 

0.  709 

0.  008 

6/30 

0.  69  3 

0.  025 

7/30 

0.684 

0.026 

8/30 

0.655 

0.  030 

9/30 

0,  640 

0.  002 

10/30 

0.  62  7 

0.  046 

1 . 000 

3.  Relative 

l)e  r  i  t.'dog  ram  ord  ina  t  c; 

evaluated  at  ti'ii  Ti  j/1999 

from  (28) 

.  a  /',  =  -20dB,  .. 

=  0.75.  (*)  indicates  a 

visit. 
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4.4.  Search  for  Periodicities 

As  a  final  example  of  mixed  spectrum  analysis  we  consider 
briefly  a  geophysical  time  series  whose  graph  is  given  in  Figure  8. 
We  do  not  attempt  complete  spectrum  analysis  but  only  further  illus¬ 
trate  the  interesting  interplay  between  zero-crossings  and  linear 
filtering  in  a  case  of  real  data.  In  this  example  the  series  con¬ 
sists  of  600  observations  sampled  every  10  minutes.  Tlu’  search  for 
periodicities  is  conducted  mostly  by  low-pass  filtering  followed  by 
extraction  of  higher  order  crossings  [40].  We  confine  ourselves  to 
low-pass  filters  which  are  combination:  of  repeated  summation,  the 
first  order  low-pass  filter  (2^1)  and  repeated  application  of  the 
sine  Butterworth  filter  with  squared  gain  [35] 


|Q(w)|^ 


1 

L 

1  +  1 

( 

'i  0)  ' 

-1-2 

^  sin 

)  J 

(3C 


where  w.  is  an  ideal  cutoff  point  and  1,  is  the  number  of  repetitions 
b 

of  the  filter.  For  each  combination  of  filters  we  extracted  30 
higher  order  crossings  and  the  periodogram  (27)  was  evaluated  for 
each  of  the  30  cases. 

1  2 

Starting  with  the  filter  (1-B)(1+B)  followed  by  (29),  with 
a  =  0.25,  the  relative  periodogram  ordinate  (27)  gave  about  907, 
weight  to  TT  ^  n  =  0.1049.  This  corresponds  to  a  period  of 

2ti/(6  X  0.1049)  =  9.983  hours. 

For  the  same  combination  of  filters  but  with  a  =  0.6,  the 
relative  periodogram  ordinate  gave  m-arlv  100,'  weigiiL  to 
I!  I)  /(N-1)  =  0.0839.  This  corresponds  t('  a  period  ol  12.481  hours 


To  detect  even  lower  significant  frequencies  (or  higher 


periods)  we  used  (30)  with  u)  =  0.084,  L  =  2  together  with 

b 

(1-B)(1+B)^^  and  (29)  with  Ot  =  0.1.  The  relative  periodogram 
ordinate  gave  about  50%  weight  to  /(N-1)  =  0.0427  which 

corresponds  to  a  period  of  24.525  hours. 

Therefore  the  data  contain  periodic  components  with  periods 
of  about  10,  12.5  and  24.5  hours.  For  a  discussion  of  signifi¬ 
cance  tests  using  periodogram  ordinates  in  conjunction  with  highe 
order  crossings,  see  [27]. 


0.0 


160.0 


320.0 


480.0 


4. 5.  A  Continuous  Spectrum  Case 


An  important  class  of  stationary  stochastic  processes  is  the 
one  characterized  by  stochastic  difference  equations  of  the  form 


Z 

t 


■  Vt-i  + 


+  <p  Z 
P  t-p 


+  c  . 
t 


( 


When  is  white  Gaussian  noise  and  the  roots  of 

1  -  0.x  -  ...  -  0  =  0  lie  outside  tlie  unit  circle,  (z  }  is 

Ip  t 

stationary  and  Gaussian  with  mean  zero  [45].  (31)  provides  a  use¬ 

ful  model  for  many  applications  [45],  [52],  and  is  referred  to  as 
an  autoregressive  process  of  order  p.  This  process  has  continuous 
spectrum  (only!)  with  spectral  density  [32] 


f  (ui) 


2 

a 

K 

2Tr 


-iw 


-ia.'p,2  ’ 


-n  <  (1)  <  TT.  ( 


Except  for  the  constant  term  (7^/2Tt,  f (cj)  is  determined  by  higher 
order  crossings  in  agreement  with  Theorem  3.  To  see  that,  define 


Then,  the  so-called  Yule-Walker  equations  imply  [45]  that 


0  = 


1 

P 


c 


From  (12)  we  know  that  are  uniquely  determined  by 

El),  ,...,FD  ,  and  so  0,,...,0  by  (33)  arc  comijletely  determined 
Ip  1  P 

from  expected  higher  order  crossings.  This  and  the  fact  that 


imply  that  the  normalized  spectral  density  (f(w)  ^  f(u))/YQ) 


1-0 

f(w)  =  - 

2Tr  1 1  - 

is  completely  determined  by 
obtain  fast  estimates  of  (34) 


1^1 


_ E_E_ 


-itj 


.-(be 

P 


•  iu)p  I  2 


ED-,..., ED  .  It  follows  that  we  can 

1  p 

by  solving  for  p  and  ({)  in  terms  of 


the  observed  D_,...,D  . 

1  P 

As  an  example  consider  the  first  order  case  of  p  =  1.  Then 
(34)  becomes 


(3- 


f(0J) 


2 

2''t(1  -  2(t>^cos  to  + 


(3! 


But  (})^  can  be  estimated  by  [36] 


(3f 


Thus  f (oi)  can  he  estimated  very  fast  from  only. 

Similarly,  in  the  seri->nd  orJer  case  a  fast  spectral  estimate 
is  obtained  from  D^,  I).,  by  substituting  these  two  higjier  crossings 
for  ED^,  ED^  in  (12).  From  this  we  obtain  in  succivssion  estimates 
for  p^,  (>2  and  then  estimates  for  if  j  ,  ^  from  (31),  and  finally  an 
estimate  for  ffio)  liy  substitution  in  (34). 

This  type  ot  speetr.'il  estim.ate  is  reminiscent  ot  .mt  ot  egi  fss  i  vc 


and  maximum  entrofty  spectral  estimates  (52li  I  331.  hut  has  mU 


5.  SIGNAL  DISCRIMINATION  BY  ZERO-CROSSINGS 


In  addition  to  the  fact  that  higher  order  crossings  contain 
useful  spectral  information,  they  also  serve  as  a  simple  and 
effective  means  for  achieving  drastic  data  reduction  important  for 
discrimination  and  classification  purposes.  In  what  follows,  we 
discuss  various  ways  for  using  the  higher  order  crossings  in 

discrimination  between  time  series  by  appealing  to  their  monotone 
property  and  the  rate  at  which  they  increase.  It  will  become 
apparent  that  in  general  very  few  D.  are  needed  as  their  discrimi¬ 
nation  potency  decreases  rather  rapidly  as  j  increases. 

Terms  such  as  discrimination,  classification,  goodness  of  fit 
and  signature  analysis  are  sometimes  used  somewhat  loosely  to  mean 
basically  the  same  thing.  By  discrimination  between  time  series  we 
shall  mean  the  process  of  determining  from  the  the  degree  of 
similarity  between  two  time  series,  or  between  a  given  time  series 
and  an  hypothesized  one.  For  example,  given  the  D,  of  an  engine 
signature,  the  discrimination  problem  is  to  determine  how  close 
these  Dj  are  to  the  tiigher  order  crossings  of  a  reliable  engine. 

Throughout  this  section  it  is  assumed  that  the  highest  fre¬ 
quency  is  IT  as,  for  example,  is  always  the  case  in  the  i)resence  of 
white  noise. 


5.1.  The  Higher  Order  Grossings  iTx'orem 


To  quantity  in  some  sense  the  amount  of  into  mat  ion  in  the 


i 


pertaining  to  discrimination,  it  is  convenient  to  introduce  a  binary 

(  i) 


[)rocess  iX^’'  '  wliich  we  havi-  been  using  implicitly  all  along.  De- 


G. 


inf.  i  ^  1 


fine  hv  clipping  fv' 


otherwise . 


Clearly  -  X^,  and  is  precisely  the  number  of  symbol 

changes  in  X^'^  \  ,  X^'^  ^ .  Of  interest  is  the  behavior  of 

large  j. 

Because  it  is  assumed  that  it  is  included  in  the  spectral  support, 
(ii)  of  Theorem  1  tells  us  that 


1, 


(37) 


This  and  the  SLT  imply  that  in  (i)  of  Theorem  1  the  inequalities  are 
strict , 

ED^  <  ED^  <  . . .  ,  (38) 

since  otherv>fise  if  Elh  =  +  l  some  j  we  immediately  have  a 

sinusoid  and  ED,/(N-1)  will  converge  to  the  frequency  of  this  sinu¬ 
soid  divided  liy  ti ,  which  contradicts  (37).  Now,  replacing  ED^  by 
D.,  we  expect  from  (37),  (38)  that  as  j  increases  {X^^M  approaches 
a  limiting  state  of  alternating  symbols: 

...010101...  .  (39) 

This  degenerate  state  is  independe\>t  ol  the  initi.il  process  i7.^!  as 
long  .as  M  is  in  tin*  S))ectruni.  This  is  sL.ated  form.il  Iv  in  the  Higher 


Order  Crossings  i'fieoreni  (ifOCT)  (44  i- 


Theorem  6  (HOCT) .  Let  {Z^},  t  = . . . , -1 , 0, 1 , . . . ,  be  a  stationary 
Gaussian  process  with  mean  zero,  and  assume  that  a  is  included  in 

the  spectral  support.  Then, 

...  r  ...010101...,  with  probability  1/2 

(i)  < 

[_  .  .  .  lOIOlO.  .  .  ,  with  probability  1/2, 

as  j 

(ii)  lim  ] im  D./(N-1)  =  1  with  probability  1. 

The  theorem  is  demonstrated  in  two  special  cases  in  Figure  9  where 

we  see  that  the  degeneracy  (39)  is  achieved  rather  fast.  The 

binary  arrays  should  be  thouglit  of  as  sections  from  infinite 

binary  arrays  while  the  rightmost  column  gives  0^,...,!)^^^  from 

series  of  length  X  =  1000.  The  process  from  which  these  data 

were  taken  is  (9)  with  (p  =  0.8,  0.5. 

From  the  above  <.liscussion  we  see  that  discrimination  based 

on  D.  for  large  j  is  a  fruitless  effort  since  different  processes 

yield  almost  the  same  Moreover,  experience  shows  that  even  for 

moderate  i  the  D.  are  not  very  helpful.  In  Figure  9,  for  example, 

later  1).  are  relatively  metre  similar  than  earlier  ones  for  two 
I 

different  processes.  In  other  words,  the  initial  rate  of  conver¬ 
gence  tL'ward  the  ileg.enera  t  e  state  (39)  is  quite  fast.  ibis  initial  rate 
however  provides  a  very  usetul  discrimin.ition  measure  and  conset|uent  ly 
only  very  few  I),  are  nei'ded  for  r'llertive  tl  i  sc  r  i  m  i  na  t  i  on  143],  (AAl, 
[54|.  Figure  10  gives  the  !) .  ,  i  =  1,...,10,  again  fritm  a  first 
order  aut  o  nq;  ress  i  ve  process  (9)  cor  la'spoiu!  i  n;',  to  dilteianit  values 
of  till'  paranietL-r  ,  whicn  N  =  lOtJf).  The  rate  ot  incvisise  in  the  1). 


1 00 1 00000 1 1 1 1 1 1 

207 

00 1 1 00 1 1 0 1 1 1 1 1 1 

513 

00 1 1 00 1 1 0 1 0 1 1 0 1 

659 

01 1001 lOOIOI 101 

715 

0 1 1 00 1 00 1 1 0 1 00 1 

745 

OlOOl 1001 lOIOOl 

773 

OlOOl 101 1 lOIOI 1 

807 

0101 1001 lOOIOl 1 

823 

0101 lOOIOOOlOlO 

829 

OlOIOOl lOIOlOlO 

849 

OlOIOOIIOIOIOlO 

855 

OlOIOI 1 lOIOIOlO 

865 

OlOIOIOOOIOIOlO 

875 

OlOIOIOIOIOIOlO 

883 

OlOIOlOlOlOIOlO 

885 

OlOIOIOIOIOIOlO 

893 

<p-0.8 

(a) 

_9.  Ofmons  t  ra  t  i  on  ijf  tho 


0111 0000 1  I  I  I  I  I  I 
0  I  1 000 1  I  1 0 1 00 1 0 
0  1 000 1  I  I  1 0  1 00 1  0 
0 1 000 1  1 0 1 0 1 0 1  10 
OlOIOIOOIOIOIIO 
OlOIOIOI lOIOlOO 
OlOIOIOI lOIOlOO 
OlOlOIOIOOIOIOI 
OlOIOIOI 00 lOIOI 
OlOIOIOlOl lOIOI 
OlOIOlOIOI lOIOI 
OlOIOIOlOl 00 101 
OlOIOIOlOIOOIOI 
OlOIOIOlOIOI 101 
OlOIOIOIOIOI 101 
OlOIOIOIOIOIOOl 

<^=0.5 

(b) 


i"fK‘r  Order  Oressings  Theorem 


using  a  first  ordor  autoregrf'ssi  ve  process.  W  = 


lono. 


Figure  10.  Plots  of  from  a  first  order  autore¬ 

gressive  process  with  parameter  (}),  N  =  1000. 

(compare  with  (38))  differs  initially  from  process  to  process,  but 
as  j  continues  to  increase  the  differences  taper  off. 

Another  exam[)le  is  provided  by  speech  data  obtained  from  the 
utterances  of  "zero",  "one",  and  "two".  See  Figure  11  for  a  typical 
appearance  of  such  data.  The  sampling  rate  was  8000  samples  per 
second  and  we  used  2000  samples  in  obtaining  in  each 

of  the  three  cases.  The  D,  in  each  of  the  thri-e  cases  are  plotted 
in  Figure  12  ;ind  are  seen  to  e.xhibit  different  initial  rates  of  in¬ 
crease  corresponding  to  the  various  cases.  The  figure  also  suggests 
that  successful  discrimination  can  be  achieved  by  fewer  than  twenty 
higher  order  crossings  since  from  1)^^  on  the  rate  of  increase  in 
the  three  cases  is  very  similar.  In  many  cases  in  [jractice  eight 


or  even  as  tew  as  six  1).  sulfici'  f()r  I'ffectivi'  discrimination. 


It  is  interesting  to  note  that  the  apparent  diminishing  utility 


of  higher  order  crossings  for  discrimination  purposes  is  essentially 
a  graphical  fact.  When  observing  the  graph  of  a  time  series,  the 
most  conspicuous  fc‘atures  are  zero-crossings,  peaks  and  troughs, 
and  inflection  points.  But  these  three  features  correspond  to 
Dj ,  D^,  and  the  inability  of  humans  to  discern  much  beyond  these 
features  parallels  the  fact  that  the  D.  for  large  j  have  virtually 
no  discrimination  potency. 


5.2.  An  Application  to  White  Woise  Testing 

To  measure  the  degree  of  similarity  between  time  series,  we 
can  use  plots  such  as  those  in  Figures  10  and  12.  By  tliemselves, 
though,  the  plots  are  not  sufficient  since  they  do  not  provide  a 
clear-cut  decision  rule.  To  circumvent  this  problem  we  can  obtain 


probability  limits  wlucdi  contain  a  given  plot  with  known  proba¬ 


bility.  In  a  wide  range  of  applications  this  goal  can  be  achieved 
if  the  variance  of  !) .  is  known.  Although  simple  closed  form  e.\- 


pressions  for  Var(lJ.)  do  luU  exist,  various  approximations  are 


.) 


available  [54],  [55  I-  A  very  useful  and  simple  approximation  can 


be  obtained  if  0,  as  k  *■  sufliciently  fast.  Define 


>5'^'  A  Pr(xJ'^^  =  l 


i  hen  it  was  shown  in  |55l  that  when  .  ,  =  0,  k 

K 


lor  some  posi¬ 


tive  int>.'gi'r  n,  we  have 


Var  I), 


I  im 


(40) 


Thus,  for  sufficiently  large  k 


Var(D|.)  -  (N-1)aJ*'\i-aJ’'^.  (41) 

It  has  been  observed  however  that  (41)  provides  a  very  good  approxi¬ 
mation  for  Var(Dj^)  also  for  small  k  in  the  white  noise  case. 

Under  the  hypothesis  of  white  noise  has  an  asymptotic  normal 
distribution  and  also  we  know  F.D|^  exactly.  Thus  we  can  use  the  ap¬ 
proximation  (41)  in  ■  erming  probability  limits  for  Approximate 

95X  probability  limits  for  D  are  given  by  [55] 

k. 

(N-l)[|  +  ^  sin“^-j^S)  ±  i.96{(N-l)[i  -  (^  sin'^*^ ) )  ^  1  (42) 


The  hypothesis  of  white  noise  is  rejected  if  c) t  least  one  , 
i=I,...,K,  falls  outside  the  limits  (41).  When  all  the  Dj , 
i  fall  inside  the  limits  (42),  the  initial  rate  of  in¬ 

crease  in  the  U.  resembles  that  of  white  noise  and  tlie  hypothesis 
J 

of  white  noise  is  accepted.  For  statistical  considerations  re¬ 
garding  this  test  see  [54],  [55  |. 

As  an  illustration  of  this  graphical  test,  consider  the  geo¬ 
physical  series  in  Figure  8.  The  first  ten  I).  together  with  the 
limits  (42)  are  plotted  in  Fijpire  11.  Since  the  lirst  two  lie 
outside  the  limits,  tin*  scries  is  not  whiti'  noise  as  is  well  ex- 
pec'  ted. 


As  another  example  c<)nsider  the  series  is  Figure  14.  For  tliis 
case,  tlu'  path  ot'  liiglier  order  crossiiu'.s  is  completi'ly  inside  the 
limits  ('(2)  and  so  tlu*  hypollK'sis  ol  v,;h  i  1 1'  noise  is  accepted. 


WHITE  NOISE  LIMITS 


1111 _ I _ I _ I _ L 

2  4  6  8 


iigs  plot  (broken  line) 


5.3.  A  Graphical  Similarity  Measure 


We  have  used  above  the  rate  of  increase  in  the  first  few  higher 
order  crossings  as  a  basis  for  a  white  noise  test.  The  same  pro¬ 
cedure  can  be  used  in  testing  the  liypothesis  that  a  given  time 
series  follows  a  specific  model;  e.g.  (31)  with  given  parameters. 
When  the  parameters  are  specified,  under  the  hypothesis,  KDj^  and 

can  he  evaluated  exactly  from  (1)  as  follows.  Define  tti 

be  the  first  autocorrelation  of  {V 

E  corr(v‘'Z|.,v‘"z^_^), 

where  now  ^  P2'  Observe  that  P^*^^  is  given  by  (12).  (1)  im¬ 

mediately  yields 


and 


/M  ,x/i  1  •  -1 

ED|^  =  (N-1)(-2  -  --  sin  ) 


,  (10 

^1 


-1 

sin  p 


(k-1) 

1 


(A3 


(44 


Experience  shows  that  substitution  of  (44)  in  (41)  still  provides 
a  reasonable  fast  approximation  to  Var(I)j^),  and  so  approximate  95/ 
probability  limits  for  are  given  by 

u 

ED,  ±  I  .  96fVar  D,  }  0  (  4  5 

k  k 

To  obtain  (45)  we  follow  the  following  steps. 

1.  Specify  a  model  for  the  time  series  by  a  hypothesis. 

2.  Obtain  the  first  few  autocorre  la  t  ions  for  the  hy|>o  thes  ixed  model 

3.  Obtain 


r  ri)m  ( !  2 ) . 


55 


4.  Evaluate  (43),  (44). 

5.  Obtain  the  probability  limits  (45). 

As  an  example,  consider  the  is  Figure  9  (b) .  Figure  16 
gives  the  probability  limits  under  various  null  hypotheses  H^.  The 
higher  order  crossings  plot  lies  completely  within  the  limits  (45) 
for  a  first  order  autoregressive  process  with  parameter  0.5,  as  it 
should,  and  the  hypothesis  is  accepted.  In  all  the  other  cases 
the  hypothesis  is  rejected.  We  conclude  that  the  series  from  which 
the  Dj  were  extracted  oscillates  as  a  first  order  autoregression 
with  the  indicated  parameter. 

For  a  power  study  of  this  test,  as  well  as  a  refinement  of 
(45)  see  [54]. 


5.4.  The  Psi-square  Statistic 

Another  device  which  exploits  the  monotonicity  in  the  is  a 
very  simple  quadratic  form  defined  as  follows.  Define  the  incre¬ 


ments  in  the  D.  by 
J 


\  -  \-l 
(N-1)  -  I), 


k  =  1 


k  =  2,. . . ,K-1 


k  =  K. 


When  N  is  sufficiently  large,  0  i  <  (N-ll  and 


A,  =  N-1 
k 


Let  Viewing  the  as  "frequencies"  in  a  multinomial 


experiment,  we  define  a  );ener.al  similarity  measure  by  [43],  [44  I 


Figure  16.  The  rate  of  Increase  in  in  (b)  resembles  that  of  a 

first  order  autoregressive  process  with  parameter  (j)  =  0.5. 

2 

are  unccjr  rc  1  a  ted  N(0,o  )  random  variables.  N  =  1000.  The  solid 
lines  ari'  tlie  probability  limits  obtained  under  H  . 


The  ip  statistic  can  be  used  in  testing  the  hypothesis  that 

iTij^  =  >  k  =  l,...,K,  or  simply  as  a  distance  measure,  in 

measuring  the  degree  of  similarity  between  time  series. 

The  probability  distribution  of  the  psi-square  statistic  (42) 

has  been  studied  in  [56],  but  no  closed  forms  expressions  are 

available  at  present.  It  has  been  observed  however  that  when  the 

come  from  linear  processes  [45]  of  the  type  presented  in 

2 

Figure  16,  the  tail  of  the  probability  density  of  ip  varies  very 

little  from  process  to  process  [43],  [44]-  In  fact,  computer  simu- 

2 

lations  show  that  in  most  cases  Pr(i|;  >30)  <  0.05,  for  K  =  9  and 

sufficiently  large  N  as  can  also  be  seen  from  Figure  17.  We  can 
2 

use  the  -value  30  as  a  critical  value  which  corresponds  to  a 
significance  level  of  less  than  0.05.  This  means  that  the  proba¬ 
bility  of  rejecting  a  true  hypothesis  is  less  than  0.05. 

As  an  example,  consider  again  the  from  Figure  9  (b) ,  and 
the  four  models  in  Figure  16.  We  obtain,  by  using  (12),  (43),  the 
m^  under  each  of  the  four  hypotheses.  The  Aj  are  computed  from  the 
Dj .  In  case  (a),  the  hypothesis  is  that  of  white  noise  and  with 
K  =  9,  N  =  1000,  we  have 


p  =  57.12  +  26.24  +  49.23  +  ...  +  0.50  >  30 


and  tlie  hypothesis  of  white  noise  is  strongly  rejectc-d.  'I'his  of 

course  is  not  a  surprise  since  the  I).  were  generated  by  a  nonwhite 

2 

noise  process,  and  we  expect  a  large  p  value.  In  ease  (b)  both 


A  density  of  1.847x^.yj  versus  an  empirical  distribution 

2  2 
of  ip  (K  =  9,  N  =  1000)  obtained  from  100  ip  values.  The  underlying 

process  is  a  first  order  autoregression  with  parameter  0.5.  In  100 

2 

(p  values  only  two  exceeded  30,  three  exceeded  29,  four  exceeded  28. 

2 

m  and  A.  come  from  the  same  process  and  we  expect  a  small  ip  . 

.1  J 

2 

Indeed,  we  obtain  ij/  -  7 . 5  <  30  and  the  hypothesis  of  a  first 
order  autoregressive  process  with  parameter  0.5  is  accepted.  For 
the  two  other  cases  the  values  are  over  40  and  over  600,  re¬ 
spectively,  and  the  two  correspond]  ng,  hypotheses  are  rejected.  We 
see  that  evaluating  ip  quantifies  the  resvilts  ol  I),  plots  such  as 
those  in  Figure  16. 

When  the  m,  in  (46)  are  replaci'd  bv  observed  .  from  another 
.1  .1 

process,  p  becomes  a  measure  of  distance.  Suppose  in  the  speech 


■.I  *  w  •  y  •y  ■.»  «ji  ■  f  ••  *.■  ■  V  f Ji  ^  "Ji  mi  VJi  v 

59 


data  the  A. 

J 

two  cases  as 
and  "one"  is 
is,  "one"  is 
well  known. 


from  "zero"  are  taken  as  "expected"  and  A  in  the  other 

J 

2 

observed  quantities.  Then  the  ip  value  from  "zero" 

2 

smaller  than  the  tp  value  from  "zero"  and  "two".  That 
closer  to  "zero"  than  "two",  but  this  is  of  course 


5.5.  Extension  to  Two  Dimensions 

The  Higher  Order  Crossings  Theorem  can  be  extended  to  two  di¬ 
mensional  processes  tj  =  ...,-1,0,1,...,  j=l,2  [57], 

so  that  the  above  methodology  can  be  extended  in  a  natural  way  to 
discrimination  between  pictures  and  images.  I'o  describe  the  de¬ 
generate  state  corresponding  to  (39)  introduce  the  notation  to 
denote  a  difference  with  respect  to  the  index  s. 


Let 

X^’'^tj,t2) 


V^^_Z(s,t)  =  Z(s,t)  -  Z(s-l,t). 

t  )  =  0  unless  '*'7^  ^Z(t,,t,)  >  0, 

z  tj^  t2  J-  ^ 

(k) 

1.  As  k  «>,  (X  f  converges  to 


in  which  case 


.  ..OiOlOl... 
...101010... 
...010101... 
...101010... 
...010101... 


..101010. 


with  probability  1/2  or  to  its  shift,  again  with  probability  1/2. 

(47)  is  the  degenerate  state  analogous  to  (39)  when  (ti.ti)  is  in  the 

spectral  support.  The  higher  order  crossings  are  now  defined  as 

the  total  number  of  horizontal  and  vertical  symbol  changes  in  the 

(k) 

binary  arrays  of  fixed  size  {X  (tj^.t^)}.  If  »  ^2  - 

enjoy  similar  properties  as  in  the  one  dimensional  case.  For  an 

application  of  these  higher  crossings  in  texture  discrimination 


6.  CONCLUDING  REMARKS 


We  have  described  in  some  detail  the  domain  of  zero-crossings 
to  which  we  refer  as  ttie  D-domain  on  account  of  the  higher  order 


crossings  ,D  .  it  is  a  graphical-combinatorial  domain.  In  retro- 
k  Hj 


spect,  several  points  need  further  clasi f ication.  First,  although 
many  of  our  results  depend  on  the  Gaussian  assumption  ,  our  ex¬ 
perience  shows  that  reasonable  deviations  from  Gaussianity  have 
almost  no  bearing  on  our  previous  conclusions.  This  is  due 
chiefly  to  the  dominant  frequency  principle  which  is  independent 
of  any  Gaussian  assumption.  Still,  it  is  of  interest  to  know 
whether  a  parallel  theory  of  zero-crossings  exists  which  disposes 
of  the  Gaussian  assumption.  In  our  opinion  this  is  possible  pro¬ 
vided  there  exists  a  relation  such  as  (2),  but  by  no  means  the 
same,  which  connects  the  zero-crossing  count  or  a  similar  quan¬ 
tity  with  the  spectral  distribution  function.  The  case  we  have  in 
mind  is  the  class  of  bound-.'d  processes. 

It  is  possible  to  oljtain  a  representation  analogous  to  (2) 
but  in  terms  of  tlic  crossings  or  a  random  curve.  Suppose  {Z^}  is 
a  zero  mean  stationary  bounded  process. 


I  I  A  for  a  1 1  t . 

Obviouslv  cannot  be  a  Gaussian  process.  In  tlie  range  (-A,A) 

define  a  uniform,  ciMi>letelv  random,  process  which  consists 

of  independent  random  variables  uniformly  distributed  in  (-A,A). 
For  e.ich  t,  has  thi’  probability  density  1  /  2A  defiiuH!  (i.e., 
positive  tiK're)  over  (-A,A).  Gorrespond  i  ng  to  Z  we  now 


Our  last  point  concerns  a  curious  similarity  between  Rice's 
formula  and  the  zero-crossing  spectral  representation  (2).  The  two 
are  not  too  far  apart  after  all.  To  see  that,  apply  to  the  cosines 
in  (2)  a  second  order  Taylor's  series  expansion  to  obtain  after 
some  cancellation  the  approximation 


ED,  dF((a) 

_ 1  1  ■'-TT _ 

N-J  .T  flT 

I  dt'M 

— 7T 


The  left  hand  side  of  (49)  is  our  definition  of  the  expected  zero¬ 
crossing  rate,  while  the  right  hand  side  has  the  form  of  Rice's 
weil-knov/n  formula  [19,  eq.  3.3-11]  for  the  zero-crossing  rate  when 
expressed  in  terms  of  the  2nd  spectral  moment,  but  modified  to  suit 
the  discrete  time  case. 
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